
DATA TRANSMISSION DEVICE AND METHOD 



BACKGROUND OF THE INVENTION 
- ( 1 ) Field of the Invention 

5 The present invention relates to a data 

transmission device and method for generating a plurality of 
compressed/encoded data of different bit rates from a single 
video signal and simultaneously transmitting the generated 
data onto a network, and more particularly, to a data 
10 transmission device and method applicable to real-time 
transmission of such compressed/encoded data. 
(2 ) Description of the Related Art 

Recent image compression/encoding techniques such 
as MPEG (Moving Picture Experts Group) have made it easy to 
15 deliver moving picture data over networks. However, in the 
case of delivering such data through the Internet in 
particular, the delivered data may possibly be transferred 
via analog telephone lines or ISDN (Integrated Services 
Digital Network) lines, and broadband communication is not 
20 necessarily available to every recipient. Under the present 

t 

circumstances, therefore, it is necessary that the 
3 resolution be lowered or the compression ratio be increased 

to permit data to be delivered at a relatively low bit rate. 

In view of this, a moving picture data delivery 
25 scheme has been conceived wherein two types of data, that is, 
one for delivery to a relatively broadband network, such as 
an intranet in a corporation, and the other for delivery to 
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a relatively narrowband network, such as the Internet, are 
generated from a single video source and are delivered 
simultaneously. For example, for a broadband network, a 
data stream compressed/encoded according to MPEG-2 is 
5 delivered at a bit rate of about 6 Mbps, and for a 
narrowband network, a data stream compressed/encoded 
according to MPEG-4 is delivered at a bit rate of about 
100 kbps. 

Heretofore, when generating a plurality of data 

10 streams for delivery at different bit rates from a single 
video source, encoders equal in number to the data streams 
to be generated are used to encode data distributed from the 
video source. Alternatively, a transcoder or the like is 
used to decode the data stream for broadband delivery and 

15 then to again encode the decoded data stream to obtain a 
data stream for narrowband delivery. 

However, the delivery of moving picture data has 
now become so popularized that there is a strong demand for 
reduction in cost of deliverer-side systems as well as in 

20 size of such systems to save installation space. Also, in 
recent years, real-timeliness or simultaneity of delivered 
data is often given importance especially in cases where the 
water levels of rivers or dams, roads, etc. are monitored 
from a remote location or a conference or a concert is 

25 broadcast live. Accordingly, there has been proposed an 
idea of incorporating a plurality of encoder engines into a 
single encoder, to generate a plurality of data streams of 
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different bit rates and deliver the generated data streams 
simultaneously . 

Meanwhile, in the case of data which has been 
compressed/encoded by using inter-frame prediction as in 
5 MPEG, an appreciable difference often occurs between the 
data amount of a picture which can be decoded by its own 
data only and the data amount of a picture which has been 
generated using the inter-frame prediction. Accordingly, 
the processing load greatly varies during the image 

10 encoding/decoding process, and also when such data is 
transmitted over a network, an actual amount of transmitted 
data can momentarily rise well above the average bit rate. 

As regards techniques for generating data by 
encoding individual objects of image and then multiplexing 

15 the encoded objects, there has been proposed a method in 
which the start timings for encoding objects are offset in 
accordance with the ranges of variations in the amount of 
code generated per frame for the individual objects, to 
smooth variation in the amount of generated code as well as 

20 in the processing load (e.g., Japanese Unexamined Patent 
Application No. H10-023427 (cf. Paragraph Nos . [0037] to 
[0051] , FIG. 5) ) . 

Thus, the compression/encoding techniques using 
the inter-frame prediction as in MPEG are associated with a 

25 problem that the amount of generated code varies over a wide 
range, as mentioned above. Especially in the case where a 
plurality of compressed/encoded data of different bit rates 
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are generated from a single video source and are delivered 
simultaneously, the amount of generated code varies over an 
even wider range, giving rise to a problem that data cannot 
be received properly where the amount of data transmitted 
5 onto the network momentarily increased. 

FIG. 9 illustrates variation in the amount of data 
observed when a plurality of compressed/encoded data of 
different bit rates are simultaneously delivered, wherein 
FIG. 9(A) shows exemplary arrangements of pictures in 

10 respective data streams generated according to MPEG-2, and 
FIG. 9(B) is a graph showing a total amount of data 
generated with respect to each picture. 

In FIG. 9(A) is illustrated the case where two 
data streams A and B of different bit rates are generated 

15 from a single video source by two encoders. A data stream 
encoded according to MPEG-2 (or MPEG-1) comprises an I 
picture encoded in a closed manner within a frame, a P 
picture encoded using forward prediction, and a B picture 
encoded using bidirectional prediction. The data streams A 

20 and B shown in FIG. 9(A) have a general picture arrangement 
in which one I or P picture is preceded and followed by two 
B pictures. A GOP (Group Of Pictures) is a unit that allows 
playback of the data stream in the middle, and one GOP 
always includes one or more I pictures. In the illustrated 

25 example, a fixed number of pictures constitutes one GOP. 

Since the I picture is generated by closed 
encoding within a frame, its data amount is noticeably large, 
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compared especially with the B picture. In a data stream 
having a picture arrangement as shown in FIG. 9 (A) , the data 
amount of I pictures accounts for nearly 1/3 of the total 
amount of the data stream. 
5 In the illustrated example, every twelve pictures 

include one I picture, and therefore, in terms of an average 
data delivery rate per second, half of the amount of data 
delivered during a period of 11/12 second is delivered 
within the remaining period of 1/12 second at a time. Where 

10 the data stream has an average bit rate of 6 Mbps, for 
example, there is a possibility that data is generated at an 
instantaneous rate of 24 Mbps when an I picture is generated. 
Further, since the bit rate of 24 Mbps is a value that 
applies to the case where data is delivered uniformly during 

15 a period of 1/12 second, data can possibly be transmitted at 
an even higher rate if the data is transmitted at a time as 
soon as it is generated. 

Also, in the case where multiple data streams are 
encoded simultaneously by multiple encoder engines 

20 incorporated in a single encoder, the encoding processes are 
usually started at the same time and I, B and P pictures are 
generated at respective identical positions, as shown in FIG. 
9 (A) . Consequently, at the timing when I pictures are 
generated for the respective data streams, the total amount 

25 of generated data sharply increases for a moment. For 
example, where the average bit rates of the data streams A 
and B are 6 Mbps and 3 Mbps, respectively, the data 
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generation rate reaches 36 Mbps (= 24 Mbps + 12 Mbps) at its 
peak. Such extreme unevenness in the amount of generated 
data instantaneously increases the load on the network for 
transmitting data and causes packet loss etc. 
5 It is possible to generate encoded data in such a 

manner as to reduce unevenness in the amount of generated 
data. In this case, however, it is necessary that the 
encoder be provided therein with a large-capacity buffer to 
encode data while temporarily storing a considerable amount 
10 of data. As a result, the transmission of data is delayed 
behind the original video, thus impairing the simultaneity. 
Also, complex control is required, which leads to an 
increase in cost of the device. 

15 SUMMARY OF THE INVENTION 

The present invention was created in view of the 
above circumstances, and an object thereof is to provide a 
data transmission device which permits unevenness in the 
amount of generated data to be reduced with ease in the case 
20 where a plurality of compressed/encoded data of different 
bit rates are generated from a single video signal. 

Another object of the present invention is to 
provide a data transmission method which permits unevenness 
in the amount of generated data to be reduced with ease in 
25 the case where a plurality of compressed/encoded data of 
different bit rates are generated from a single video signal. 

To achieve the first object, there is provided a 
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data transmission device for generating a plurality of 
compressed/encoded data of different bit rates from a single 
video signal and simultaneously transmitting the 
compressed/encoded data onto a network. The data 

5 transmission device comprises a synchronizing signal 
detection section for detecting a synchronizing signal from 
the video signal input thereto, a plurality of 
compressing/encoding sections for compressing/encoding the 
video signal to generate data streams of different bit rates, 

10 respectively, a timing control section for controlling the 
compressing/encoding sections in accordance with the 
detected synchronizing signal such that timings for starting 
compression/encoding processes in the compressing/encoding 
sections are shifted from one another in units of frame, and 

15 a multiplexing section for sequentially multiplexing the 
data streams generated by the respective compressing/ 
encoding sections and transmitting the multiplexed data onto 
the network. 

Also, to achieve the second object, there is 
20 provided a data transmission method for generating a 
plurality of data streams of different bit rates by 
compressing/encoding a single video signal and for 
simultaneously transmitting the data streams onto a network. 
The data transmission method comprises the step of detecting 
25 a synchronizing signal from the input video signal, the step 
of shifting start timings for compression/encoding processes 
corresponding to the generation of the respective data 
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streams from one another in units of frame in accordance 
with the detected synchronizing signal, and the step of 
generating fragmented packets carrying the individual data 
streams in accordance with amounts of data generated per 
5 unit time by the respective compression/encoding processes, 
and transmitting the fragmented packets onto the network at 
equal intervals within the unit time. 

The above and other objects, features and 
advantages of the present invention will become apparent 
10 from the following description when taken in conjunction 
with the accompanying drawings which illustrate preferred 
embodiments of the present invention by way of example. 



BRIEF DESCRIPTION OF THE DRAWINGS 
15 FIGS. 1(A) to 1(C) illustrate the principle of the 

present invention; 

FIG. 2 is a diagram showing a system configuration 
of an image delivery system to which a data transmission 
device of the present invention is applicable; 
20 FIG. 3 is a block diagram illustrating functions 

of a data transmission device according to one embodiment of 
the present invention; 

FIG. 4 is a diagram showing a configuration of a 
signal system for controlling operation start timings for 
25 individual encoders; 

FIGS. 5(A) to 5(E) are a time chart showing 
waveforms of signals sent to the individual encoders from a 
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video timing section; 

FIGS. 6(A) and 6(B) illustrate arrangements of 
pictures in respective generated data streams and amounts of 
generated data; 

5 FIGS. 7(A) and 7(B) illustrate data amounts of 

respective pictures in a generated data stream and packets 
carrying the pictures; 

FIG. 8 is a flowchart illustrating a process 
performed on each of the data streams by a packet generating 
10 section; and 

FIGS. 9(A) and 9(B) illustrate variation in data 
amount observed when a plurality of compressed/encoded data 
of different bit rates are delivered simultaneously. 



15 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Embodiments of the present invention will be 
hereinafter described with reference to the drawings. 

FIG. 1 illustrates the principle of the present 

invention . 

20 A data transmission device 1 of the present 

invention is a device for compressing/encoding a video 
signal from a camera or the like and delivering a generated 
data stream in real time onto a network 2. In this case, 
since networks configured up to recipients can be either 

25 broadband or narrowband, a plurality of data streams of 
different bit rates are generated from a single video source, 
so as to match with the different bands, and are delivered 
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simultaneously. 

As shown in FIG . 1(A), the data transmission 
device 1 comprises a synchronizing signal detection section 
11 for detecting a synchronizing signal from a video signal 
5 input thereto, compressing/encoding sections 12a and 12b for 
compressing/encoding the input video signal to generate data 
streams of different bit rates, respectively, a timing 
control section 13 for controlling the compressing/encoding 
sections 12a and 12b such that timings for starting 

10 compression/encoding processes in the compressing/encoding 
sections 12a and 12b are shifted from each other in units of 
frame, and a multiplexing section 14 for sequentially 
multiplexing the data streams generated by the respective 
compressing/encoding sections 12a and 12b and transmitting 

15 the multiplexed data onto the network 2. The example shown 
in FIG. 1 includes only two compressing/encoding sections, 
but three or more compressing/encoding sections may be 
provided. 

The synchronizing signal detection section 11 
20 detects a synchronizing signal necessary for the detection 
of start timing for a frame or field or horizontal scanning, 
demodulation of chrominance signals, etc., from the input 
video signal. In the case where NTSC (National TV Standards 
Committee) composite signal is input as the video signal, 
25 for example, vertical synchronizing signal, horizontal 
synchronizing signal, color synchronizing (color burst) 
signal, etc. are detected as the synchronizing signal. The 
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detected synchronizing signal is supplied to the timing 
control section 13. 

Using the thus-supplied synchronizing signal, the 
timing control section 13 controls the timings for starting 
5 the compression/encoding processes in the respective 
compressing/encoding sections 12a and 12b. In this case, 
the timings for starting the compression/encoding processes 
in the compressing/encoding sections 12a and 12b are 
controlled so as to be shifted from each other in units of 
10 frame. 

Under the control of the timing control section 13, 
the compressing/encoding sections 12a and 12b compress/ 
encode the input video signal at respective different bit 
rates, to generate moving picture data streams. 

15 The multiplexing section 14 sequentially 

packetizes and multiplexes the data streams generated by the 
compressing/encoding sections 12a and 12b, and transmits the 
multiplexed data onto the network 2. Also, the multiplexing 
section 14 carries out control such that fragmented packets 

20 are generated in accordance with the amount of data 
generated by the individual compressing/encoding sections 
12a and 12b and are transmitted at equal intervals within a 
unit time. 

The compressing/encoding sections 12a and 12b are 
25 provided in the data transmission device 1 correspond in 
number to data streams to be generated and are operated in 
parallel. Thus, although there is a "shift" between the 
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start timings for generating the respective data streams, no 
delay occurs in the subsequently generated data streams, 
making it possible to deliver image with remarkably enhanced 
simultaneity . 

5 Operation of the data transmission device 1 will 

be now described on the assumption that NTSC (National TV 
Standards Committee) composite signal is input as the video 
signal . 

The video signal input to the data transmission 
10 device 1 is supplied to the two compressing/encoding 
sections 12a and 12b as well as to the synchronizing signal 
detection section 11. Each of the compressing/encoding 
sections 12a and 12b receives, as the video signal, a 
digital video signal which has been obtained by processing 
15 the video signal in an A/D conversion section etc., not 
shown, and starts to compress/encode the video signal in 
accordance with a control signal from the timing control 
section 13. 

The compressing/encoding sections 12a and 12b 
20 generate data with different compression ratios and 
different resolutions, but pictures are generated at the 
same intervals. In the case where the compression/encoding 
is performed using inter-frame prediction, the arrangements 
of pictures in the data streams generated by the respective 
25 compressing/encoding sections 12a and 12b are usually the 
same . 

Thus, by shifting the start timings for the 
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compression/encoding processes from each other in units of 
frame, the picture arrangements of the data streams 
generated by the respective compressing/encoding sections 
12a and 12b can be made to differ from each other. As a 
5 consequence, the data amount of generated picture sharply 
increases at different timings between the data streams, 
whereby variation in the overall data amount of the 
generated data streams can be smoothed. 

On the other hand, the synchronizing signal 

10 detection section 11 detects, as the synchronizing signal, a 
vertical synchronizing signal, a color synchronizing signal, 
etc. from the video signal, and outputs the detected signals 
to the timing control section 13. The timing control 
section 13 acquires frame (or field) start timing from the 

15 vertical synchronizing signal input thereto, and also 
generates a chrominance subcarrier signal synchronized with 
the color synchronizing signal. The chrominance subcarrier 
signal is used as a reference signal when separating 
chrominance signals in the input video signal, etc. 

20 In the case of NTSC signal, the phase of the 

chrominance subcarrier signal is inverted every two fields 
(i.e., every frame). Accordingly, the timing control 
section 13 can easily acquire the shift amount corresponding 
to one frame by detecting the phase of the chrominance 

25 subcarrier signal at the frame start timing, as shown in FIG. 
1(B), and can provide the compressing/encoding sections 12a 
and 12b with the start timings for their respective 
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compression/encoding processes. Specifically, when the 
. frame start timing synchronizes with the rise timing of the 
chrominance subcarrier signal, the compressing/encoding 
section 12a is instructed to start the process, and when the 
5 frame start timing thereafter synchronizes with the fall 
timing of the chrominance subcarrier signal, the 
compressing/encoding section 12b is instructed to start the 
process. This serves to smooth variation in the total data 
amount of the generated data streams. 

10 The generated data streams are output to the 

multiplexing section 14. As shown in FIG. 1(C), when the 
amount of data generated per unit time by either of the 
compressing/encoding sections 12a and 12b is too large, the 
multiplexing section 14 generates fragmented packets 

15 carrying such data. For example, a reference value is set 
for the data amount to be carried by one packet, and if the 
amount of data corresponding to one picture (i.e., one 
frame) in either of the data streams exceeds the reference 
value, the picture data is fragmented into a plurality of 

20 packets. The thus-fragmented packets are transmitted at 
equal intervals within a unit time. 

The packet transmission control described above 
makes it possible to further smooth variation in the amount 
of data transmitted to the network 2, to lower the peak bit 

25 rate of transmitting data, and to reduce the transmission 
load. The reference value specifying the data amount to be 
carried by one packet may be set as desired, taking account 
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of the performance of the data transmission device 1 itself 
and the capacity of the network 2. 

As described above, according to the present 
invention, the synchronizing signal detected from the input 
5 video signal is used so that the timings for starting the 
compression/encoding processes in the respective 
compressing/encoding sections 12a and 12b may be shifted 
from each other in units of frame. Consequently, a sudden 
increase in the amount of generated data takes place at 

10 scattered timings in the respective data streams, and since 
variation in the total amount of transmitting data can be 
smoothed as a result, packet loss etc. can be prevented from 
occurring due to increase in the network load. 

Especially in the case where NTSC composite signal 

15 is used as the input, control timings for the compressing/ 
encoding sections 12a and 12b can be derived with ease from 
the vertical synchronizing signal and color synchronizing 
signal detected from the input video signal. It is 
therefore possible to control the start timings for the 

20 compression/encoding processes without the need to use a 
counter circuit etc. for obtaining a shift amount of frame 
start timings, for example, and accordingly, the device can 
be reduced in cost and size. 

Further, in accordance with the data amount of the 

25 data stream generated by each of the compressing/encoding 
sections 12a and 12b, the multiplexing section 14 generates 
fragmented packets carrying the data stream, and transmits 
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the fragmented packets at equal intervals, whereby variation 
in the amount of data transmitted onto the network 2 can be 
made even smoother. In this manner, the present invention 
uses the start timing control for the compression/encoding 
5 processes in combination with the packetization control for 
generated data, whereby the effect of smoothing variation in 
the amount of data transmitted onto the network 2 can be 
enhanced without impairing simultaneity of transmitted data. 

In the example described above, the timings for 

10 starting the compression/encoding processes are shifted from 
each other by one frame, but the start timings may be 
shifted from each other by two or more frames. Also, where 
three or more data streams of different bit rates are 
generated simultaneously, the process start timings of the 

15 respective compressing/encoding sections may be sequentially 
shifted in units of frame. In this case, however, the shift 
amount is preferably set to one frame, in order to more 
securely smooth variation in the amount of generated data. 

An embodiment of the present invention will be now 

20 described in detail. In the following description, the 
invention is applied, by way of example, to a system for 
delivering image to recipients connected to an intranet such 
as a LAN (Local Area Network) in a corporation, or to the 
Internet . 

25 FIG. 2 shows a system configuration of an image 

delivery system to which the data transmission device of the 
present invention can be applied. 
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The image delivery system shown in FIG. 2 
generates data streams from images acquired by cameras 110a 
and 120a and delivers the generated data streams in real 
time to recipients through networks. The image delivery 
5 system is used, for example, to monitor the water levels of 
rivers or dams, roads, etc. from a remote location or to 
broadcast a conference or a concert live. 

The image delivery system comprises data 
transmission devices 110 and 120 to which the cameras 110a 

10 and 120a are connected, respectively, and receiving 
terminals 210 and 220 for receiving moving picture data 
streams delivered from the data transmission devices 110 and 
120. The data transmission devices 110 and 120 are 
connected through an intranet 310 to the Internet 320. The 

15 receiving terminal 210 is connected to the intranet 310, 
while the receiving terminal 220 is connected to the 
Internet 320 through, for example, a telephone line, not 
shown . 

The cameras 110a and 120a each acquire an image 
20 and output the acquired image as NTSC composite signal. The 
data transmission devices 110 and 120 compress/encode the 
image signals from the respective cameras 110a and 120a in 
accordance with MPEG, and transmit the compressed/encoded 
data onto the intranet 310. The receiving terminals 210 and 
25 220 have the function of receiving the thus-transmitted data 
streams through the intranet 310 and the Internet 320, 
respectively, decoding and displaying the data, and each 
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comprise, for example, a computer such as a PC (Personal 
Computer) . 

The system may include larger numbers of data 
transmission devices and receiving terminals than 
5 illustrated. However, the data transmission devices need to 
be connected to the intranet 310. 

The intranet 310 is a network that warrants high- 
speed data transmission/reception as a whole, compared with 
the Internet 320. The data transmission device 110 

10 generates, based on a single video source from the camera 
110a, data streams of different bit rates matching the 
respective transmission bands of the intranet 310 and 
Internet 320, and transmits the generated data streams. 
This is the case with the data transmission device 120. 

15 Accordingly, of the data streams delivered from 

the data transmission device 110, for example, a data stream 
A with a relatively high bit rate of several Mbps can be 
received properly by the receiving terminal 210, and a data 
stream B with a relatively low bit rate of several hundred 

20 kbps can be received properly by the receiving terminal 220, 
as shown in FIG . 2. Each of the data transmission devices 
110 and 120 transmits data by multicasting, for example, and 
the receiving terminals 210 and 220 can select the bit rate 
at which they are to receive the data stream. 

25 Processing functions of the data transmission 

device 110, as an example, will be now described. FIG. 3 is 
a block diagram illustrating the functions of the data 
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transmission device 110. 

As shown in FIG. 3, the data transmission device 
110 comprises an A/D conversion section 111, a sync 
detection section 112, a low-pass filter 113, a Y/C 
5 separation section 114, a filtering/scaling section 115, an 
output formatting section 116, encoders 117a and 117b, a 
video timing section 118, and a packet generating section 
119. 

The A/D conversion section 111 samples the NTSC 

10 composite signal transmitted from the camera 110a and 
converts the signal to a digital video signal. The sync 
detection section 112 detects a vertical synchronizing 
signal, a horizontal synchronizing signal and a color 
synchronizing signal from the digital video signal converted 

15 by the A/D conversion section 111, and outputs the detected 
signals to the video timing section 118. 

The low-pass filter 113 passes only low-frequency 
components of the video signal from the A/D conversion 
section 111, to thereby remove noise. The Y/C separation 

20 section 114 separates the video signal supplied from the 
low-pass filter 113 into a luminance signal and color 
differential signals . 

The filtering/scaling section 115 performs 
resolution conversion and effective image area cropping on 

25 the video signal which has been subjected to the Y/C 
separation. The output formatting section 116 buffers, on a 
frame-by-frame basis, the video signal supplied from the 
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filtering/scaling section 115, and provides a non-interlaced 
video signal. 

The encoders 117a and 117b each receive the video 
signal from the output formatting section 116 and, in 
5 accordance with respective predetermined resolutions and 
compression ratio settings, compress/encode the video signal 
according to MPEG-2 . In this case, the encoder 117a 
generates a data stream A with a relatively high average bit 
rate of several Mbps, for example, and the encoder 117b 

10 generates a data stream B with a relatively low average bit 
rate of several hundred kbps, for example. The encoders 
117a and 117b generate pictures in response to the same 
field start signal from the video timing section 118, as 
described later, and accordingly, no delay of data occurs as 

15 a result of the compression/encoding processes by these 
encoders, thus permitting very nearly real-time generation 
of data with respect to the original video signal. 

The video timing section 118 receives the 
synchronizing signals detected by the sync detection section 

20 112 and controls the operation timings of the encoders 117a 
and 117b, etc. Using the color synchronizing signal from 
the sync detection section 112, the video timing section 118 
generates a 14.318-MHz synchronizing signal, which is a 
reference signal for the synchronization of video signal 

25 within the data transmission device 1. This synchronizing 
signal is used as a sampling frequency by the A/D conversion 
section 111, for example, and also the vertical and 
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horizontal synchronizing signals from the sync detection 
section 112 are synchronized with this synchronizing signal. 
Further, the video timing section 118 generates a 
chrominance subcarrier signal (3.5785 MHz) synchronized with 
5 this synchronizing signal. 

The video timing section 118 generates field and 
frame start signals based on the vertical synchronizing 
signal, as described later, and also outputs the chrominance 
subcarrier signal in order to control the frame start 
10 timings and compression/encoding start timings of the 
individual encoders 117a and 117b by means of these signals. 
A detailed configuration for controlling such operation 
start timings will be described with reference to FIG. 4. 

On receiving the data streams generated by the 
15 respective encoders 117a and 117b, the packet generating 
section 119 assembles the data into IP (Internet Protocol) 
packets, multiplexes the packets, and then sends the 
multiplexed packets onto the intranet 310. 

FIG. 4 illustrates the configuration of a signal 
20 system for controlling the operation start timings of the 
encoders 117a and 117b. 

To control the operation start timings of the 
encoders 117a and 117b, the video timing section 118 outputs 
the field start signal Sfd, frame start signal Sfm and 
25 chrominance subcarrier signal Sc. 

The field start signal Sfd is a pulse signal 
synchronized with rise timing of the vertical synchronizing 
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signal, and provides the encoders 117a and 117b with field 
start timing . 

The frame start signal Sfm is output once for 
every two pulses of the field start signal Sfd and is input 
5 to two AND gates 118a and 118b. The other input terminal of 
the AND gate 118a is input with the chrominance subcarrier 
signal Sc, and the other input terminal of the AND gate 118b 
is input with a phase-inverted signal of the chrominance 
subcarrier signal Sc. Output signals of the AND gates 118a 

10 and 118b are input to the encoders 117a and 117b, 
respectively, thereby providing the encoders 117a and 117b 
with their respective compression/encoding start timings. 

FIG. 5 is a time chart showing the waveforms of 
signals transmitted to the encoders 117a and 117b from the 

15 video timing section 118. 

In an NTSC system, the field period and the 
chrominance subcarrier signal maintain a relationship such 
that they are synchronized once in every four fields, and 
the phase of the chrominance subcarrier signal is inverted 

20 every two fields (i.e., every frame). Accordingly, if 
timing T501 at which a pulse of the frame synchronizing 
signal Sfm is output as shown in FIG. 5(A) coincides with 
rise timing of the chrominance subcarrier signal Sc shown in 
FIG. 5(B), the timing T502 at which the next pulse of the 

25 frame synchronizing signal Sfm is output coincides with fall 
timing of the chrominance subcarrier signal Sc. 

Thus, when the compressing/encoding sections 12a 
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and 12b are instructed to start their processes, the start 
signal Sa is output from the AND gate 118a at the timing 
T501 when the frame start signal Sfm is output for the first 
time after the reception of the instruction, as shown in FIG. 
5 5(C). In response to the start signal Sa, the operation of 
the encoder 117a is started. 

The AND gate 118b is input with the phase-inverted 
signal of the chrominance subcarrier signal Sc, and at the 
next frame start timing T502, rise timings of the phase- 

10 inverted signal and frame start signal Sfm coincide with 
each other, as shown in FIG. 5(D). Thus, the start signal 
Sb is output from the AND gate 118b, as shown in FIG . 5(E), 
and the operation of the encoder 117b is started. As a 
result, the operation start timing of the encoder 117b is 

15 delayed by one frame from that of the encoder 117a. 

In the illustrated example, the frame start signal 
Sfm is used for comparison with the phase of the chrominance 
subcarrier signal Sc, but the field start signal Sfd may be 
used instead. 

20 FIG. 6 illustrates picture arrangements of 

individual data streams and amounts of data generated when 
the operation start timings are controlled in the manner 
described above. 

The data streams A and B generated by the encoders 

25 117a and 117b, respectively, have a picture arrangement such 
that one I or P picture is preceded and followed by two B 
pictures, as shown in FIG. 6(A). Also, in both data streams 
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A and B, one GOP includes the same number of pictures. 

Since the timing for starting the compression/ 
encoding process in the encoder 117b is delayed by one frame 
from the operation start timing of the encoder 117a, the 
5 individual pictures appearing in the generated data streams 
A and B are shifted from each other by one frame, as shown 
in FIG. 6(A). Thus, the I or P pictures of the data streams 
A and B invariably appear at different positions. 

FIG. 6(B) shows the amount of data generated for 

10 each picture in the individual data streams A and B. As 
seen from the figure, in both data streams A and B, the data 
amount of I picture is especially large, compared with the 
data amount of B picture. However, since the I pictures of 
the data streams A and B appear at different positions, 

15 variation in the total amount of data generated per frame in 
the two data streams can be smoothed, compared with the case 
where the I pictures are generated at the same time. 

The timings for starting the respective 
compression/encoding processes are controlled by the video 

20 timing section 118 in the aforementioned manner, whereby 
variation in the total data amount of the data streams 
generated by the encoders 117a and 117b can be smoothed. To 
this end, the video timing section 118 uses control signals 
which are generated based on the synchronizing signals 

25 included in the original video signal. The control signals 
used in this case are the synchronizing signals which are 
generated also in conventional devices to cause the encoders 



- 24 - 



117a and 117b to start processing the digital video signal 
converted from an input analog video signal. Therefore, the 
aforementioned timing control for the compression/encoding 
processes can be implemented by simply adding circuitry 
5 including AND gates, and thus with a simple structure, 
whereby increase in the cost of the device and in the 
installation space therefor can be minimized. 

The following describes how variation in the 
amount of transmitting data is smoothed by the packet 

10 generating section 119. FIG. 7 illustrates the data amounts 
of respective pictures generated in the data stream A or B 
and packets carrying the pictures. 

FIG. 7(A) shows the data amount of each picture in 
the data stream A generated by the encoder 117a, by way of 

15 example. As seen from the figure, the I picture shows an 
extremely large data amount within the data stream, as 
compared with the B or P picture. Thus, the packet 
generating section 119 sets a reference value Dl as an upper 
limit for the data amount to be carried by one packet. In 

20 the figure, D2 indicates a data amount which is twice the 
reference value Dl . 

In the illustrated example, the data amounts of I 
pictures are greater than the reference value Dl, and in 
such cases, the packet generating section 119 fragments the 

25 picture data into a plurality of packets. FIG. 7(B) shows 
the amounts of data carried by respective packets and the 
timings for transmitting the packets. 
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As shown in FIG . 7 (B) , a picture whose data amount 
does not exceed the reference value Dl is contained in one 
packet and transmitted at the frame period. On the other 
hand, for a picture whose data amount is larger than the 
5 reference value Dl, an amount of picture data corresponding 
to the reference value Dl as the upper limit is contained in 
one packet while the remaining data is contained in a 
separate packet (s). In the case of the I pictures shown in 
the figure, their data amount exceeds twice the reference 

10 value Dl, and accordingly, each I picture is fragmented and 
contained in three packets. 

The packets fragmented in this manner are 
sequentially transmitted at respective timings which are 
obtained by equally dividing the period up to the next 

15 packet transmission timing for transmitting the subsequent 
picture by the number of generated or fragmented packets. 
This permits a large amount of I picture data to be 
transmitted in a distributed manner within the transmission 
period allocated to one picture, making it possible to 

20 prevent packet loss etc. from being caused due to sudden 
increase in the transmission load on the network (intranet 
310) . 

FIG. 8 is a flowchart showing a process performed 
on each of the data streams A and B by the packet generating 
25 section 119. 

The packet generating section 119 is provided, for 
example, with buffers for receiving input data from the 
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encoders 117a and 117b, respectively. In Step S801, data 
corresponding to one frame, that is, one picture, is read 
from the buffer, and in Step S802, the data amount of the 
picture thus read out is detected. 
5 Then, in Step S803, the detected data amount is 

compared with the packetization reference value Dl, to 
calculate the number of packets to be generated. If the 
detected data amount exceeds n times the reference value Dl 
and at the same time is smaller than or equal to (n + 1) 

10 times the reference value, (n + 1) is set as the number of 
packets to be generated to carry the data. 

In Step S804, an interval for transmitting the 
fragmented packets is calculated from the calculated number 
of packets. In the case where (n + 1) has been set as the 

15 number of packets as mentioned above, a value obtained by 
dividing 1/30 second, which is the transmission interval 
allocated to one picture, by the number of packets, (n + 1) , 
is set as the interval for transmitting the fragmented 
packets. The packet generating section 119 is also provided 

20 with a timer for counting the packet transmission interval, 
and sets the timer to count the calculated transmission 
interval . 

Subsequently, in Step S805, predetermined header 
information and the like are affixed to the data 
25 corresponding in amount to the reference value Dl to 
generate a first UDP (User Datagram Protocol) packet, and 
transmits the packet at predetermined frame synchronization 
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timing onto the intranet 310. 

In Step S806, it is determined whether or not all 
of the packets corresponding to one picture have been 
transmitted- If all of the packets have been transmitted, 
5 the process for this picture is ended, whereupon the process 
is again executed from Step S801 to transmit the next 
picture. On the other hand, if there is a packet or packets 
which are not transmitted yet, the process proceeds to Step 
S807. 

10 In Step S807, the timer count is monitored to wait 

until a time period corresponding to the transmission 
interval set in Step S804 elapses. Upon lapse of the set 
time period, the process returns to Step S805 to generate 
and transmit the next packet. Steps S805 to S807 are 

15 repeated thereafter until transmission of all packets 
corresponding to one picture is completed. 

The process shown in FIG. 8 is a process performed 
on one data stream. With respect to the packets 

corresponding to the two pictures of the respective data 

20 streams A and B, the packet generating section 119 
continuously transmits the packets at the timing of the same 
frame period. Because of the process start timing control 
carried out by the video timing section 118, an I picture, 
which has a large data amount, is never generated 

25 simultaneously in both data streams A and B. Usually, 
therefore, packet fragmentation is not simultaneously 
carried out in both data streams A and B. Thus, when 
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picture data of the data stream A, for example, is 
fragmented, the packet of the data stream B may be 
continuously transmitted immediately after the first packet 
of the data stream A is transmitted. 
5 The reference value Dl specifying the data amount 

to be carried by one packet can be changed as desired. 
Accordingly, the data transmission amount can be controlled 
appropriately taking account of the performance of the data 
transmission device 110 itself as well as the capacity and 
10 communication state of the intranet 310 to which data is 
transmitted . 

Thus, in the embodiment described above, the 
timings for starting the respective compression/encoding 
processes are controlled by the video timing section 118 

15 while at the same time the transmission is controlled by the 
packet generating section 119 such that fragmented packets 
are generated in accordance with the amount of generated 
data and are transmitted at equal intervals, whereby 
variation in the amount of transmitting data can be easily 

20 smoothed without impairing simultaneity of transmitting data. 

As described above, in the data transmission 
device of the present invention, the timing control section 
controls the timings for starting the compression/encoding 
processes in the respective compressing/encoding sections so 

25 as to be offset in units of frame. Accordingly, the 
compressing/encoding sections generate data streams having 
respective different picture arrangements, and thus, the 
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data amounts of generated pictures increase and decrease at 
different timings in the respective data streams, making it 
possible to smooth variation in the total amount of data 
transmitted to the network. Also, the timing control of the 
5 timing control section is carried out based on synchronizing 
signals detected by the synchronizing signal detection 
section, whereby the device can be simplified in structure. 
Further, the multiplexing section generates fragmented 
packets carrying the individual data streams in accordance 

10 with amounts of data generated per unit time by the 
respective compressing/encoding sections and transmits the 
fragmented packets at equal intervals within the unit time, 
whereby variation in the amount of data transmitted to the 
network can be made even smoother. 

15 The foregoing is considered as illustrative only 

of the principles of the present invention. Further, since 
numerous modifications and changes will readily occur to 
those skilled in the art, it is not desired to limit the 
invention to the exact construction and applications shown 

20 and described, and accordingly, all suitable modifications 
and equivalents may be regarded as falling within the scope 
of the invention in the appended claims and their 
equivalents . 
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